# Low-resource Deployment

Deepseek Ai DeepSeek R1 Distill Qwen 14B GGUF
DeepSeek-R1-Distill-Qwen-14B is an optimized large language model with a parameter scale of 14B, released by DeepSeek AI. It is distilled from the Qwen architecture and offers multiple GGUF quantization versions to improve performance.
Large Language Model
D
featherless-ai-quants
237
1
Nvidia AceReason Nemotron 7B GGUF
Other
AceReason-Nemotron-7B is a large language model based on the Nemotron architecture with 7B parameters, offering multiple quantized versions to accommodate different hardware requirements.
Large Language Model
N
bartowski
209
2
Allura Org Q3 30B A3B Designant GGUF
A Llamacpp imatrix quantized version based on allura-org/Q3-30B-A3B-Designant, suitable for various quantization needs, supporting role-playing and conversational tasks.
Large Language Model
A
bartowski
344
1
AM Thinking V1 GGUF
Apache-2.0
AM-Thinking-v1 is a text generation model based on the GGUF format, suitable for various natural language processing tasks.
Large Language Model Transformers
A
Mungert
1,234
1
Primeintellect INTELLECT 2 GGUF
Apache-2.0
Quantized version of INTELLECT-2, optimized using llama.cpp, supporting multiple quantization types to accommodate different hardware requirements.
Large Language Model
P
bartowski
6,268
4
Andrewzh Absolute Zero Reasoner Coder 7b GGUF
Llamacpp quantized version based on andrewzh's Absolute_Zero_Reasoner-Coder-7b model, supporting multiple quantization levels, suitable for reasoning and code generation tasks.
Large Language Model
A
bartowski
1,325
5
Nvidia OpenCodeReasoning Nemotron 14B GGUF
Apache-2.0
This is the Llamacpp imatrix quantized version of the NVIDIA OpenCodeReasoning-Nemotron-14B model, suitable for code reasoning tasks.
Large Language Model Supports Multiple Languages
N
bartowski
1,771
2
Parakeet Tdt 0.6b V2 Onnx
NVIDIA Parakeet TDT 0.6B V2 is a model based on automatic speech recognition (ASR) tasks, suitable for English speech-to-text tasks.
Speech Recognition English
P
istupakov
129
3
Goekdeniz Guelmez Josiefied Qwen3 8B Abliterated V1 GGUF
This is a quantized version of the Qwen3-8B model, using llama.cpp for iMatrix quantization, suitable for chat scenarios.
Large Language Model
G
bartowski
7,520
12
Medra GGUF
Apache-2.0
Medra is a medical domain-specific QA and summarization model supporting English and Romanian, designed for medical AI applications.
Large Language Model Supports Multiple Languages
M
mradermacher
195
0
Allura Org Remnant Glm4 32b GGUF
Apache-2.0
Remnant-GLM4-32B is a 32B-parameter large language model based on the GLM4 architecture, supporting role-playing and conversational interactions, particularly suitable for salamander-related applications.
Large Language Model
A
bartowski
2,198
2
Mlabonne Qwen3 8B Abliterated GGUF
This is the quantized version of the Qwen3-8B-abliterated model, quantized using llama.cpp, suitable for text generation tasks.
Large Language Model
M
bartowski
6,892
5
Deepthink 1.5B Open PRM Q8 0 GGUF
Apache-2.0
Deepthink-1.5B-Open-PRM is a 1.5B parameter open-source language model, converted to GGUF format for use with llama.cpp.
Large Language Model English
D
prithivMLmods
46
2
Mistral Community Pixtral 12b GGUF
Apache-2.0
This is the quantized version of the pixtral-12b model, quantized using llama.cpp, supporting image-text-to-text tasks.
M
bartowski
1,728
4
Gemma 2 9b It Abliterated GGUF
A quantized version based on Gemma 2.9B, optimized using llama.cpp, suitable for running in LM Studio.
Large Language Model English
G
bartowski
3,941
37
Qwen2.5 1.5B Instruct GGUF
Apache-2.0
Qwen2.5 is the latest series of the Qwen large language model, featuring a 1.5B parameter instruction-tuned model that supports multilingual and long text generation.
Large Language Model English
Q
Mungert
556
4
Llama OuteTTS 1.0 1B Bf16
This is a text-to-speech model based on the MLX format, supporting multiple languages and suitable for speech synthesis tasks.
Speech Synthesis Supports Multiple Languages
L
mlx-community
23
0
Deepcoder 1.5B Preview AWQ
MIT
DeepCoder-1.5B-Preview is a large language model for code reasoning, fine-tuned from DeepSeek-R1-Distilled-Qwen-1.5B through distributed reinforcement learning, capable of handling longer context lengths.
Large Language Model Transformers English
D
adriabama06
72
2
Slim Orpheus 3b JAPANESE Ft Q8 0 GGUF
Apache-2.0
This is a GGUF format model converted from the slim-orpheus-3b-JAPANESE-ft model, specifically optimized for Japanese text processing.
Large Language Model Japanese
S
Gapeleon
26
0
Phi 4 Reasoning
MIT
Phi-4 Reasoning is a cutting-edge open-weight reasoning model based on Phi-4, fine-tuned with supervised chain-of-thought trajectory datasets and trained via reinforcement learning, specializing in mathematics, science, and programming skills.
Large Language Model Transformers Supports Multiple Languages
P
microsoft
11.31k
172
Minimaid L2
Apache-2.0
MiniMaid-L2 is a role-play specialized model further optimized from MiniMaid-L1, achieving outstanding performance among 3B-scale models through knowledge distillation and training on a larger dataset.
Large Language Model Transformers English
M
N-Bot-Int
63
2
Open Thoughts OpenThinker2 7B GGUF
Apache-2.0
Quantized version of OpenThinker2-7B, using llama.cpp for quantization, suitable for text generation tasks.
Large Language Model
O
bartowski
1,023
5
Orpheus 3b 0.1 Ft Q2 K.gguf
Apache-2.0
This model is a GGUF format conversion of canopylabs/orpheus-3b-0.1-ft, suitable for text generation tasks.
Large Language Model English
O
athenasaurav
25
0
Orpheus 3b 0.1 Ft Q4 K M GGUF
Apache-2.0
This model is a GGUF-format conversion of canopylabs/orpheus-3b-0.1-ft, suitable for text generation tasks.
Large Language Model English
O
athenasaurav
162
0
Gemma 2 2b It Tool Think
MIT
Text generation model fine-tuned based on google/gemma-2b-it, supporting tool call reasoning process
Large Language Model Transformers
G
langdai
36
2
Gemma 3 12b It Int4 Gguf
Gemma 3 is a lightweight multimodal open model from Google that supports text and image inputs with text outputs, featuring a 128K large context window and support for 140+ languages.
Image-to-Text
G
gaunernst
107
1
Orpheus Bangla GGUF
Apache-2.0
This is the static quantized version of the asif00/orpheus-bangla-tts model, supporting Bengali text-to-speech tasks.
Speech Synthesis Other
O
mradermacher
416
0
Orpheus 3b 0.1 Ft Q6 K GGUF
Apache-2.0
This is a GGUF format model converted from canopylabs/orpheus-3b-0.1-ft, suitable for text-to-speech tasks.
Large Language Model English
O
TheVisitorX
191
0
Orpheus 3b 0.1 Ft Q2 K GGUF
Apache-2.0
This is a GGUF format model converted from the canopylabs/orpheus-3b-0.1-ft model, suitable for text generation tasks.
Large Language Model English
O
Zetaphor
67
1
Qwen Encoder 0.5B GGUF
Apache-2.0
This is a statically quantized version of the knowledgator/Qwen-encoder-0.5B model, primarily designed for text encoding tasks.
Large Language Model English
Q
mradermacher
175
1
Llama 3.1 Nemotron Nano 8B V1 GGUF
Other
An 8B-parameter open-source large language model released by NVIDIA, based on the Llama-3 architecture, offering multiple quantization versions
Large Language Model English
L
tensorblock
1,048
4
Gemma 2 2b Jpn It Translate GGUF
A statically quantized model based on webbigdata/gemma-2-2b-jpn-it-translate, supporting translation tasks between Japanese and English.
Machine Translation Supports Multiple Languages
G
mradermacher
164
1
Beaverai MN 2407 DSK QwQify V0.1 12B GGUF
Apache-2.0
A large language model based on 12B parameters, supporting text generation tasks, released under the Apache-2.0 license.
Large Language Model
B
bartowski
1,547
5
Lightblue Reranker 0.5 Bin Filt Gguf
This is a text ranking model used for reordering and scoring texts to improve the relevance of search results.
Text Embedding
L
RichardErkhov
2,101
0
Gemma 3 12b It GGUF
Gemma 3 12B is a large language model that provides a quantized version in GGUF format, suitable for local deployment and use.
Large Language Model Transformers
G
tensorblock
336
1
Jbaron34 Qwen2.5 0.5b Bebop Reranker Newer Small Gguf
A 50-million-parameter text reranking model based on the Qwen2.5 architecture, suitable for information retrieval and document ranking tasks
Large Language Model
J
RichardErkhov
2,117
0
Gemma 3 4b Pt Qat Q4 0 Gguf
Gemma 3 is a lightweight open model series launched by Google, built on the same technology as Gemini, supporting multimodal input and text output.
Image-to-Text
G
google
912
16
Open R1 OlympicCoder 7B GGUF
Apache-2.0
OlympicCoder-7B is a 7B-parameter large language model focused on code generation, based on open-r1/OlympicCoder-7B with llama.cpp quantization, supporting multiple quantization level options.
Large Language Model English
O
bartowski
5,859
9
Financeconnect 13B I1 GGUF
Apache-2.0
FinanceConnect-13B is a 13B-parameter large language model specialized in the financial domain, supporting natural language processing tasks such as summarization, classification, and translation.
Large Language Model English
F
mradermacher
261
1
Whisper Large V3.w4a16
Apache-2.0
This is the quantized version of openai/whisper-large-v3, employing INT4 weight quantization and FP16 activation quantization, suitable for vLLM inference.
Speech Recognition Transformers English
W
nm-testing
20
1
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase